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SOME MEASURE-THEORETIC PROPERTIES OF 
GENERALIZED MEANS 

IRINA NAVROTSKAYA AND PATRICK J. RABIER 


Abstract. If A is a measure space, u : —>■ R is a given function and N > 

m, the function U{x^,...,xn) = ^ ) Ei<ii<...<i^<Ar “(a:*!, •••,) 

is called the generalized A-mean with kernel u, a terminology borrowed from 
C/-statistics. Physical potentials for systems of particles are also defined by 
generalized means. 

This paper investigates whether various measure-theoretic concepts for gen¬ 
eralized A-means are equivalent to the analogous concepts for their kernels: 
a.e. convergence of sequences, measurability, essential boundedness and in- 
tegrability with respect to absolutely continuous probability measures. The 
answer is often, but not always, positive. This information is crucial in some 
problems addressing the existence of generalized means satisfying given con¬ 
ditions, such as the classical Inverse Problem of statistical physics (in the 
canonical ensemble). 

1. Introduction 

Let A be a set and let g : —>• M be a given function. If > m is an integer, 

the generalized A^-mean of g is the symmetric function U : A^ —)■ E defined by 
[/(xi, xat) := where the sum carries over all m-tuples 

(A, im) € {1, N}^ such that ij ^ ik if j ^ k. 

If := X! ^CT(m)) whcrc a runs over the permutations 

of {1,m}, then u is symmetric and 

(1.1) Uixi,...,XN) = ( ^ j X u{x,,,...,x^^). 

The function u is customarily called the kernel of U and m is the order (or degree) 
of U. Obviously, u = g if m = 1 but, unlike g, the kernel u is always uniquely 
determined by U, a point to which we shall return below. 

Generalized means lie at the foundation of [/-statistics, a field extensively stud¬ 
ied since its introduction by Hoeffding [1]. The nomenclature (generalized mean, 
kernel, order) is taken from it. In U -statistics, the kernel u is given, the variables 
Xi,...,Xn are replaced with random variables Ai,...,Ajv and the interest centers 
on the behavior of statistically relevant quantities as A —>■ oo. 

Physical potentials for systems of N particles when the interaction of any m < N 
particles is taken into account are also defined by generalized means (without the 
scaling factor, an immaterial difference). The variable Xj captures the relevant 
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information about the particle (often more than just its space position, so 
that A need not be euclidean space) and the energy depends upon 

the interaction of the particles ii, In some important questions, notably the 

famous “Inverse Problem” of statistical physics m , i, H), the potential U is not 
given. Instead, the issue is precisely the existence of a potential U of the form (ED 
satisfying given conditions. 

For example, when the Inverse Problem is set up in the “canonical ensemble”, N 
is fixe43 and U should maximize a functional T{V) (relative entropy) over a class 
of potentials V of the form (ED characterized by various integrability conditions. 
Accordingly, the set A is equipped with a measure dx and a generalized iV-mean U 
of order m is still defined by (ED, but now with equality holding only a.e. on A^ 
for the product measure. If a maximizing sequence Un is shown to have some type 
of limit [/, is [/ a generalized iV-mean of order ml In the affirmative, what are the 
properties of the kernel u oiU that can be inferred from the properties oi U 1 

In this paper, we provide answers to the above queries which, apparently, cannot 
be found elsewhere. These answers play a key role in the recent work [7] by the first 
author and they should have value in the broad existence question for generalized 
means. The existence of extremal potentials is essential in particle physics; see for 
instance [5] for a variant of the Inverse Problem arising in coarse-grain modeling, 
but usually assumed (if not taken for granted). More generally, there are numerous 
examples borrowing from information theory when generalized means are sought 
that maximize some kind of entropy. Even though the maximization involves only 
a finite number of parameters in many of these examples, it should be expected 
that some of the more complex models require a measure-theoretic setting. 

From now on, the measure dx on A is u-finite and complete and, to avoid trivial¬ 
ities, A has strictly positive dx measure. The cr-finiteness assumption ensures that 
the product measure dx®* on A* and its completion d*x are defined and that the 
classical theorems (Fubini, Tonelli) are applicable. The terminology “a.e.”, “null 
set” or “co-null subset” (complement of a null set) always refers to the measure d*x 
or its measurable subsets. Since d*x is complete and A* has positive d*x measure, 
a co-null subset of A* is always measurable and nonempty. 

The symmetry of m or {7 is not needed in any of our results and will not be 
assumed. Accordingly, the “generalized mean operator” Gm,N given by Gm ,at (m) := 
U is well defined (Remark [2T]) and linear on the space of all a.e. finite functions on 
A™. The exposition is confined to real-valued functions, but everything can readily 
be extended to the vector-valued case. 

The recovery of the kernel u from the generalized mean U will play a crucial 
role. There are elementary ways to proceed, which may explain why no general 
procedure seems to be on record. (Tenth addresses the completely different 
problem of finding u when t/ = C/at in (ED is known for every N but the order 
m is unknown.) However, the problem becomes much more subtle after noticing 
that the most natural formulas for the kernel are useless for measure-theoretic 
purposes. Indeed, after suitable modifications of u and U on null sets, it may be 
assumed that (ED holds pointwise. Then, if m = 1, we get u{x) = U{x,...,x) 
but, of course, this does not show whether the measurability of U implies the 
measurability of u. When m > 1, other simple formulas for the kernel, for instance 


major difficulty; in the “grand canonical ensemble”, N is free and the problem was solved 
by Chayes and Chayes [I]. 
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u(xi,X 2 ) = ^U{xi, X 2 , ■■■,X 2 ) — U{x 2 , ■■■,X 2 ) when m = 2, suffer from similar 

shortcomings. 

More sophisticated representations of u in terms of U will be needed to prove 
that, indeed, u is measurable whenever U is measurable lTheorem l3.1|) . These rep¬ 
resentations call for the explicit introduction of the “kernel operator” Kjn,N inverse 
of Gm,N, i-e., Kjn,NiU) = u. We shall show that, when N > m > 2, Km,N{U) 
can be recovered from U and the four operators Km,N-i, Km-i,N,GN-i,N and 
Gm-i,N-i (see (13.3|) and also (13.11) when m = 1; that Km,m = I is trivial). It fol¬ 
lows that many properties of the kernels can be established by transfinite induction 
on the pairs {m,N) with m < N, totally ordered by {m,N) < (m!,N') ii m < m' 
or m = m' and N < N'. This method is systematically used throughout the paper. 

There are also unexpected differences between pointwise and a.e. limits of gen¬ 
eralized 7V-means. Suppose once again that m = 1 and that Un{xi, ...,xn) = 
^~^Y,^=i'^ri{xi) for every {xi,...,Xn) & A^, so that u„(x) = Un(x,...,x). If 
the sequence {/„ has a pointwise limit [/, then has the pointwise limit u(x) = 
U(x, ...,x). This is true irrespective of whether U achieves infinite values on A-^. 
In contrast, if it is only assumed that C/„ has an a.e. limit C/, the sequence Un(x) 
may have no limit for any x G A; see Example 12.II Nonetheless, we shall see in the 
next section that if U is a.e. finite, then Un{x) converges for a.e. x G A (Theorem 
12.31) . Thus, the finiteness of the limit U, irrelevant when pointwise convergence is 
assumed 0, makes a crucial difference when only a.e. convergence holds. 

Measurability and essential boundedness are discussed in Section [H Section |4] 
is devoted to more delicate integrability issues. The general problem is as follows: 
A symmetric probability density P on A^ induces a natural symmetric probability 
density T(m) on A™ with m < N upon integrating P with respect to any set of N—m 
variables. Is it true that a generalized TV-mean U of order m is in U’{A^\Pd^x) 
if and only if its kernel u is in L’'(A™; ? 

Assuming P > 0 a.e., the answer is positive when r = 00 , but the necessity may 
be false if r < 00 (Example 14.11) . This has immediate and important consequences 
in existence questions. In a nutshell, the problem of finding a generalized iV-mean 
in U’{A^-,Pd^x) is not always reducible to the problem of finding its kernel in 
L’"(A'"; P(^rn)dPx) and the former may have solutions when the latter does not. 

The next step is to investigate whether conditions on P ensure that the above 
discrepancies do not occur. Such a condition is given in Theorem 14.51 It always 
holds in some arbitrarily small perturbations of any symmetric probability density 
iTheorem 14.7|) and, when A has finite dx measure, it is even generic (in the sense 
of Baire category) among bounded symmetric probability densities (Theorem 14.81) . 
This supports the idea that, while probably not necessary, the condition in question 
is sharp. Yet, it is instructive that it fails when Pd^x is the probability that 
N particles have given coordinates, under the natural assumption that P = 0 
when two coordinates are equal. If so, the existence of potentials U = Gm,N(u) G 
L’'(A'^; Pd^x) with u ^ L’'(A™; P(^)d™x) cannot be ruled out. 

Our approach also provides good guidelines for the treatment of the more gen¬ 
eral problem when (HH) is a weighted mean, but since there are significant new 
technicalities, exceptional cases, etc., this problem is not discussed. 


^If m > 1 , the finiteness of U is not entirely irrelevant to the pointwise convergence issue, 
which however remains markedly different from a.e. convergence. 
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2. Almost everywhere convergence 

In this section, we prove that a sequence Un of generalized A^-means of order m 
has an a.e. finite limit U if and only if the corresponding sequence of kernels 
has an a.e. finite limit u. The next example shows that the “only if” part is false 
if U is infinite. 

Example 2.1. With A = [0,1] and dx the Lebesgue measure, let fn denote the 
well-known sequence of characteristic functions of subintervals Jn with \Jn\ —>■ 0, 
such that, when x is fixed, fn{x) assumes both the values 0 and 1 for arbitrarily 
large n m p. 94]/ Ifun ■= 2n(l — fn), then Un = 0 on and Un = 2n otherwise. 
Also, Un(x) has no limit for any x since Un{x) assumes both the values 0 and 2n for 
arbitrarily large n. On the other hand, Un ■= C?i, 2 (wri) is 0 on Jn x. Jn o,nd either 
n or 2n otherwise. As a result, Un tends to oo off the diagonal of [0,1]^ -a set of 
dfx measure 0- since {xi,X 2 ) ^ Jn x Jn if Xi / X 2 and n is large enough. Thus, 
Un has an a.e. limit but Un does not. 

If Un = Gm,N(un), then (a) Un and {7„ can be modified on n-independent null 
sets of A™ and A^, respectively, in such a way that is everywhere finite and that 
Un is given by (ED for every (cci,..., x^v) £ (so that Un is everywhere finite; 
see also Remark [2.11 belowl. On the other hand, (b) ii Un ^ U a.e., Un and U 
can be modified on null sets independent of n in such a way that convergence holds 
everywhere. However, (a) and (b) cannot be achieved simultaneously. Indeed, a 
modification of the left-hand side of ED on a null set of need not preserve the 
sum structure of the right-hand side, whereas such a modification may be needed 
to ensure that Un U pointwise. Thus, it is generally not possible to assume 
both pointwise convergence and pointwise sum structure of Un- In fact, if this 
were always possible, the sequence Unix) of Example 12. II would have the a.e. limit 
t/(x,x), which is false for every x. 

We begin with a simple lemma. 

Lemma 2.1. Let 1 < k < N be integers and let Tk be a co-null set of A^. Then, the 
set Tn ■■= {(a;i, ...,Xn) £ A^ : ixj^,...,XjJ £ Tk for every 1 < ji < ■ ■ ■ < jk < Af} 
is co-null in A^. 

Proof. As a preamble, note that the product S' x T of two co-null sets S and T 
in A-l and A^, respectively, is co-null in the product A^^^ since its complement 
((A^\S) X A^) U (A-^ X (A^\T)) is a null set in A-l+^. This feature is immediately 
extended to any finite product of co-null sets. Now, Tn is the intersection of the 
sets {(xi, ...,xn) £ A^ : (xj^, ...jX^^) G Tk} for fixed 1 < ji < ■ ■ ■ < jk < N. Such 
a set is the product of N — k copies of A and one copy of Tk and therefore co-null 
in A^ from the above. Thus, TN is co-null. □ 

Remark 2.1. It follows at once from Lemma \2.1\ that an a.e. modification of u 
creates only an a.e. modification of Gm,Niu). 

The next lemma is the special case m = 1 of Theorem 12.31 below. 

Lemma 2.2. Let Un be a sequence of generalized N-means of order 1 and let Un 
denote the corresponding sequence of kernels (i.e., Un = Gi.iv('ara)/ Then, there is 
an a.e. finite function U on A^ such that Un ^ U a.e. if and only if there is an 
a.e. finite function u on A such that Un ^ u a.e. on A. If so, U = Gi,Ar(u). 
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Proof. Assume first that u exists. With no loss of generality, we may also assume 
that Un and u are everywhere defined and that Un ^ u pointwise on A. Then, 
Un Gi^m{u) pointwise. 

Conversely, assume that U exists. With no loss of generality, we may also assume 
that Un and U are everywhere finite and that Un{xi, ...,xn) = N~^ Sti ’^nixi) € 
K for every (xi, ...,xn) € . By hypothesis, 

(2.1) Un{xi,...,XN) -t U{xi,...,xn), 

for a.e. {xi,...,xn) & A^, but recall that it cannot be assumed that Un ^ U 
pointwise; see the comments after Example 12.11 

Let E C A^ denote the co-null set on which holds. There is a co-null set 
Sn-1 of A'^”^ such that, for every {x2, ■■■,Xn) S S'at-i, the subset Ex 2 ,...,Sn •= 
{a: G A : (x, X 2 , ■■■, Xn) G E} is co-null in A. This is merely a rephrasing of the fact 
that a.e. (a? 2 ,..., a? jv) -section of the complement of if is a null set of A. 

From now on, (x2, .■.,Xn) S 'S’a-i is chosen once and for all. By definition of 
Sn-1, if a: G then 

(2.2) Unix) + Unix2) H-h UniXN) “t A^t7(x,X2, ■■■,Xn)- 

Thus, if (xi, ...,xn) G (I2?^ implies 

UniXj) + Unix2) H-h Un{XN) “t NUiXj,X2, ...,Xjv), 1 <j < N 

and so, by addition, 

(2.3) w„(xi) -I- \-UnixN) + N{unix2) H-l-u„(xjv)) nT{xi , ..., Xn) , 

where l{xi, ...,xn) ■= T,^^^U{xj,X2, ...,xn), i-e., 

(2.4) T=NGi^NiUi-,X2,:;XN))- 

Since both E and EA _ are co-null in A^, their intersection E fl EA ~ is 

X2,...,XN ’ X2,...,XN 

co-null. Let (xi, ...,Xn) G if H ifA be chosen once and for all, so that (12.11) 
and (12.311 hold. This implies that Unix 2 ) + ■■■ + Unix n) has a finite limit, namely 
l(xi, ...,XAr) —C/(xi, ...,xn). Then, by (12.21) and (12.4L m„(x) tends to the finite limit 

(2.5) u(x) := 

NUix,X 2 , ...,xn) + Uixi,...,XN) - iVGi,Ar(C/(-,X2,..., XAr))(xi,..., xat), 

for every x G Ex 2 ,...,xm- Thus, Un ^ u a.e. on A and so, by the first part of the 
proof, U = Gi,iv(u). □ 

Theorem 2.3. Let Un be a sequence of generalized N-means of order m >1 and 
let Un denote the corresponding sequence of kernels (i.e., Un = Gm.Niun))- Then, 
there is an a.e. finite function U on A^ such that Un ^ U a.e. if and only if 
there is an a.e. finite function u on A™ such that Un ^ u a.e. on A™. If so, 
U = Gm,Niu). 

Proof. As in the proof of Lemma l2.2l when m = 1, the sufficiency is straightforward. 
We now assume that U exists. With no loss of generality, we may also assume that 
Un and U are everywhere finite and that 
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for every {xi, ...,xn) & A^. By hypothesis, 


(2.7) 


Un{xi, ...,Xn) -?■ U{X1, ...,Xn), 


for a.e. {xi, ...,xjsf) € . Once again, it cannot also be assumed that Un ^ U 

pointwise. 

The proof that has an a.e. finite limit u will proceed by transfinite induction 
on the pairs {m,N) with N > m, totally ordered by {m,N) < ii m < m' 

or if m = m! and N < N'. For the pairs (rn,m) (with no predecessor), this must 
be proved directly, but is trivial since Un = Un in this case. If m = 1, the result 
follows from Lemma 12.21 irrespective of N. Accordingly, we assume m > 2 and 
that the result is true for generalized A'^-means of order m — 1 with N > m — 1 
(hypothesis of induction on m, which is satisfied when m = 2) and for generalized 
(N — l)-means of order m with N > m + 1 (hypothesis of induction on N) and 
show that it remains true for Un above. 

Let E C A^ denote the co-null set on which (EZD holds. There is a co-null set 
S ' of A such that, for every xn G S, the subset := {( xi , ...,xn-i) € : 

(xi, ...,xn-i,xn) G E} is co-null in A^“^. Let xn G S be chosen once and for all. 
If (xi, ...,X7v-i) G Exni then by splitting the cases when im < N — 1 and im = N 
in the right-hand side of (1^ . we may rewrite (lO) with Xn = Xn as 


( 2 . 8 ) 


E 


lixi 


! ^im ) T 


E 






— i ^N—1 





By Lemma EH the set n := {(xi,..., xtv) S A^ : (xj^,..., Xj„_ J e E^^ for 
every I < < ■ • ■ < Jn-i < N} is co-null in and, if (xi,...,xjv) G then 

(12.811 holds with (xi,..., x^v-i) replaced with (x^j,..., Xj„_J and I < < • ■ • < 

Jn-i < E fixed. If so, the m-tuple (xi^,..., x^^) in the left-hand side of (12.811 
becomes {xj.^, ...,Xj.^). Since ji < ■ • ■ < Jn-i and < • • • < im, it follows that 
jij < • • • < ji^. We do not write down the corresponding formula (12.811 for this 
case, because there is a simpler (and otherwise crucial) way to formulate it without 
involving sub-subscripts, as detailed below. 

There are only N different ways to pick TV — I variables Xj,^, ...,Xj^_j with in¬ 
creasing indices among the N variables xi,...,xjv, that is, by omitting a different 
variable Xfc,l < k < N. For every such fc, the variables Xjj,..., Xj„_j are then 
Xi,..., Xfe,..., Xat where, as is customary, Xk means that Xu is omitted. Thus, as 
ii,.--,im run over all the indices such that 1 < ii < ■■■ < im "£ N — 1, the 
corresponding m-tuples (xjj^,..., Xj^^) run over all the m-tuples (x^j,..., x^^) in 
which the variable Xfc does not appear, that is, over the m-tuples (x^^, ...,xe^) with 
I < < ■ ■ ■ < £m < N and ^ k ,..., im 7 ^ fc- 
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In light of the above, when (xi^,Xi^) is replaced with {xj._^, ...,Xj^^) in (|2.8I1 
and the variable Xk does not appear, the formula (I2.8p takes the form 


(2-9) E Un{xi^,...,xe^) + 

l<£l<---<em<N 
£i ^ k, ...,£m k 

E Unixe^,...,Xi^_^,XN)- 

l<£l<---< £m-l < N 

£\ ^ —1 ^ k 


N 

m 


U{xi, ..Xk, ...,Xn,Xn). 


N— 1 variables 


Recall that (12.91) holds for every 1 < fc < V and every (a;i, ...,xn) G £ 1 . By adding up 
the relations (1^ for A: = 1,A^, we get (after replacing £i,with zi, 


N 

( 2 . 10 ) ^ ^ UniXi^, ...,Xi^) + 

1 '^1 ^ k, ^ k 

N f N \ 

E E Un{Xi^,...,Xi^_^,XN) i j 1 {xi,...,Xn), 

1 < 71 < ■ • ■ < 7^-1 <N 

1 ’^1 ^ k,..., im—i ^ k 


for every {xi, ...,a:jv) e £1, where l{xi, ...,X]\f) := Y,^^.^U{xi, ...Xk, ...,xn,Xn), i.e.. 


( 2 . 11 ) 


I = NGn-i,n{U{-,xn)). 


Let 1 < 7i < ••• < im < N he fixed. In the first double sum of (I2.10L 
Un{xi^, ...,Xi^) appears exactly once for every index 1 < fc < iV such that k ^ 
ii, ...,k 7 ^ im. Evidently, there are N — m such indices k and so 


N 

E E Un{Xi^,...,Xi^) = 

1 < 71 < • • • < 7™ < iV 
(N-m) E 


N 


Un{Xim ■.■,Xi^) = {N -m) 


m 


Un{xi,...,XN). 
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The second double sum in (I2.10p has the same structure as the first one, with m 
replaced with to — 1. Thus, 

N 


E 

E 

'^n ^im — i 1 ) — 

fc=i r 

1 ^ <C * ■ * <7 im — i E ^ 


1 

ii ^ Zm—1 7^ k 



[N — m + 1) 



l<2l<- 






Hence, (12.101) boils down to (recall (12.111) 1 

{N - m)Un + mGm-i,N{un{-,XN)) NG n-i,n{U{-,xn)) on H. 

By (12.71) . Un ^ U on E. Thus, 

(2.12) Gm,-i,N{un{-,XN)) ^ rn~^ {NGn-i,n{U{-,xn)) - {N - m)U) on 12 D S. 

Since 12 D i? is co-null in A^, this means that the sequence Gm-i,N(,Un{-, a? at )) has 
an a.e. finite limit. Therefore, the hypothesis of induction on to ensures that there 
is an a.e. finite function v on such that u„(-,XAr) ^ v a.e. on A™“^. Thus, 

Gm-i,N-i{u{-,XN)) —t Gm-i,N-i{v) On some co-null subset Tn-i of A^“^ and 
then, from (12.81) . 

_ JJI 

(2.13) Gm,N—l (^n) ^ "^7 ^(*; ) "77 Gm — 1,N — 1 (^) CHI TjV —1 ^ ■ 

N — m N — m 

Since D is co-null in A^“^, this shows that Gm,N-i{un) has an a.e. 

finite limit. That has an a.e. finite limit u thus follows from the hypothesis of 
induction on N and so, by the first part of the proof, U = Gm ,n{u). □ 

The following corollary gives a short rigorous proof of an otherwise intuitively 
clear property. 

Corollary 2.4. A generalized N-mean U of order m has a unique kernel u (up to 
modifications on a null set). 

Proof. If u and v are two kernels of U, set = u if n is odd and = u if n is 
even. Then, Gm,N{un) = U a.e. for every n. By Theorem 12.31 Un converges a.e., 
which implies u = v a.e. □ 

Significant differences between to = 1 and to > 1 are worth pointing out (with¬ 
out proof, for brevity). When to = 1, a stronger form of Lemma 12.21 holds: If 
Gi.iv('Rra) —t U a.e. with U finite on a subset of of positive d^x measure, then 
Un ^ u a.e. and u is finite on a subset of A of positive dx measure (the converse 
is clearly false). There is no such improvement of Theorem 12.31 if to > 1. It is not 
hard to find examples (variants of Example 12.ip when Gm,N(un) has an a.e. limit 
which is finite on a subset of of positive d^x measure, but has no limit at 
every point of a subset of A of positive dx measure. 










GENERALIZED MEANS 


9 


3. Kernel recovery and measurability 

As pointed out in the Introduction, there are rather simple ways to recover u 
from Gm,N(u), but the resulting formulas are inadequate to answer even the most 
basic measure-theoretic questions. Below, we describe a recovery procedure that 
preserves all the relevant measure-theoretic information. 

The fact that the kernel of a generalized A^-mean of order m is unique (Corollary 
I2.4[l means that Gra,N has an inverse Km,N (“kernel operator”), i.e., Km,N{U) = u. 
By the linearity of Gm,N, it follows that Km,N (defined on the space of generalized 
Ai-means of order m) is also linear. 

Explicit formulas for Ki^]\j and Km,m will be given but, when m > 2 and N > 
m + 1, Km,N will only be defined inductively, in terms of Km-i,N and Km,N-i 
(and also Gm-i,N-i and Gn-i.n)- This makes it possible, by transfinite induction 
(as in the proof of Theorem 12.31) to recover Km,N for arbitrary m and N > m. 

First, Gm,m = Km,m = I IS obvious. To define we return to the proof of 

Lemma |2.21 in the case when the sequences and t/„ are constant and therefore 
equal to their a.e. limits u and U, respectively. If so, (j2.5ll yields the kernel of U : 

(3.1) Ki,n{U) = 

NU{-,X2, ■..,Xn) + U{xi,...,Xn) - NGi^n{U{-,X2, ...,xn)){xi,...,xn), 

where X 2 , ■■■,xn is arbitrarily chosen in some suitable co-null subset of and 

(xi, ...,xn) is arbitrarily chosen in some suitable co-null subset of . (Of course, 
these co-null subsets depend on U.) 

Note that (j3.ip immediately shows that Ki^n{U) is measurable if U is measurable 
since it is always possible to choose X 2 , ■■■,xn such that U{-,X 2 , ■■■, xn) is measurable 
(the extra term t/(xi,..., xn) — NGi^NiU{■,X2, ■■■, xn)){xi, ...,xn) in (13.11) is just a 
constant). 

To define Km,N in terms of Km-i,N and Km,N-i when m >2 and N > m + 1, 
we return to the proof of Theorem 12.31 when the sequences and t/„ are constant 
and therefore equal to their a.e. limits u and U, respectively. If so, Un = u and 
(12.121) is the a.e. equality 

GTn-i,N{u{-,XN)) = {NGn-i,n{U{-,xn)) - {N - m)U ), 

where xn is arbitrarily chosen in some suitable co-null subset of A. This shows that 
NGn-i,n{U (•, Xn)) — {N — m)U is a generalized A^-mean of order m — 1 and that 

(3.2) u{-,xn) = (A^G'w-i.Ar(C^(-, 2^)) - {N - m)U). 

Next, (12.131) is the a.e. equality (since v = u{-,Xn) in (12.131) ') 

N ^ Tfl ^ 

Gm,,N-l{u) = —- U{-,Xn) — -NZ - Gm-l,N-l{u{-,XN))- 

IS — m IS — m 

This shows that NU{-,xn) — 'mGm-i,N-i{u{-,XN)) is a generalized {N — l)-mean 
of order m and, by (13.21) . that Kjn,N{U) = m is given by (when 2 <m < N — 1) 

(3.3) {N-m)KmAU) = 

Km,N-l iNU{-,XN) — Gm-1,N-1 {NGN-i,n{U{■, Xn)) — {N — m)U))). 

If an a.e. defined function on A^ is said to be symmetric if it coincides a.e. with 
an everywhere defined symmetric function, then a generalized N-mean is symmetric 
if and only if its kernel is symmetric. Indeed, it is obvious that the operators Gm,N 
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preserve symmetry. Conversely, since Km,m = I and Ki^n (obviously) preserve 
symmetry, Km,N preserves symmetry by (13.31) and transfinite induction. 

Theorem 3.1. A generalized N-mean U of order m is measurable if and only if 
its kernel u is measurable. 

Proof. It is obvious that the measurability of u implies the measurability of U. From 
now on, we assume that U is measurable and prove that u is measurable. 

This is trivial when m = N and was already noted after when m = 1. If 
TO > 2 and N > to+ 1, we proceed by transfinite induction, thereby assuming that 
the theorem is true if to is replaced with to — 1 and, with to being held fixed, if N 
is replaced with iV — 1. 

In (13.3|) . choose Xn so that U{-,xn) is measurable. This is possible since xn is 
arbitrary in a co-null subset of A. In particular, GN-i,NiU{-,XN)) is measurable 
(recall that the measurability of the kernel always implies the measurability of the 
generalized mean) and so NG n-i^n{U{■ ,xn)) — {N — m)U is measurable. Since 
this is a generalized iV-mean of order to — 1 (see the proof of (|3.3ll l. the hypothesis 
of induction on to ensures that its kernel is measurable. It follows that NU (•, xn) — 
Gm-i,N-i (Km-i.N (AIG N -i,n{U{■ ,x n)) — {N — m)U)) is measurable. Since this 
is a generalized {N — l)-mean of order to with kernel {N — m)Km,N{U) = {N — m)u 
(see once again the proof of p.3l) l. it follows from the hypothesis of induction on 
N that (N — m)u is measurable. Thus, u is measurable. □ 

To complement the measurability discussion, we investigate essential bounded¬ 
ness. In the next theorem, || • ||Afc_oo denotes the L°° norm on A^. 

Theorem 3.2. (i) If u is a measurable function on A, its generalized N-mean 
U := Gi^n{u) is essentially bounded above (below) if and only if u is bounded above 
(below). Furthermore, ess sup C/ = ess sup u and ess inf [/ = ess inf m. In particular, 
u € L°°{h.-, dx) if and only if U € L°°] d^x) and, if so, ||17||aw_oo = ll'^IU.oo- 
(ii) Letl < m < N be integers. Ifu £ L°^(A"‘; d'^x), then Gm,N{u) G L'^{A^]d^x) 
and ||G'm,Ar(M)||A^,oo ^ ||m||a™,cx 3 - Conversely, if U £ L°°{A^-,d^x) is a general¬ 
ized N -mean of order to, then Km,N{U) £ L°°{A^] dC'x) and there is a constant 
G(m,N) independent of U such that \\Km ,A(17)||A-.oo<C(TO,iV)||C7||AK,oo- 

Proof, (i) If a G R and m > a on a subset Fi of A of positive dx measure, then U > a 
on , of positive d^x measure. This shows that ess sup 17 > ess sup u. On the 
other hand, it is obvious that U < ess sup u a.e. on A^, so that ess sup U < ess sup u. 
This shows that ess sup 17 = ess sup m. Likewise, ess inf 17 = ess inf it. 

(ii) The first part (when u £ L°°(A"*; d™a;)) is straightforward. Suppose now 
that U £ L°“(A^;d^ x) is a generalized A^-mean of order to. If to = 1, the result 
with (7(1, N) = 1 follows from (i) and it is trivial when m = N (with G{m, to) = 1). 
To prove the existence of G{m, N) in general, we proceed once again by transfinite 
induction. The hypothesis of induction is that to > 2,A > to-|- 1 and that the 
constants G{m — 1, A) and G{m,N — 1) exist. 

In (13.311 . choose such that U{-,Xn) is measurable and that ||17(-,a:Ar)||A"-i,oo ^ 
llt^lU^.oo (which holds for a.e. Xn £ A). Then, ||(7 ai-i,iv( 17(-,a?7v)||A'^.oo < 
l|t^llA^,oo and it routinely follows from (13.31) that G{m,N) may be defined by 

G(m, A) := + (2A - m)G{m - 1, A)). 

N — m 

□ 
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Remark 3.1. In the vector-valued case, part (i) of Theorem 1.9, does not make 
sense as stated, but ||C/||oo,A'' < ||u||oo,A < (2iV +1)||J7 ||oo,an follows at once from 
U.l\) and i3.1\) . 

Clearly, the result in part (i) that Gi^nIu) is essentially bounded above (below) 
if the same thing is true of u remains true for Gra,N{u), but “only if” is false if 
m > 1, as shown in the following example. 

Example 3.1. Suppose that A is the countably infinite union of pairwise disjoint 
subsets with positive dx measur^. With no loss of generality, call these subsets 
with i G and define u on A^ by setting u = \i\ |j| on A^ x A^ if i ^ —j and 
u = —\j\ if i = —j. Then, u is measurable, symmetric and not essentially bounded 
below. Yet, U := G 2 , 3 {u) > 0. Indeed, if U{xi,X 2 ,X 3 ) < 0, at least one of the pairs 
{xi,X 2 ),{x 2 ,X 3 ) or(xi,X 3 ) is in A-j X Aj for somc j G Z. With no loss of generality, 
suppose that {xi,X 3 ) G A_j x Aj, so that u{xi,X 3 ) = —|j|. If X 2 G Ak with k ±j, 
then u(xi,X 2 ) = u{x 2 ,X 3 ) = |j| + |A:|, so that U{xi,X 2 ,X 3 ) = \j\ + 2\k\ > 0, which 
is a contradiction. Thus, k = j or k = —j. If k = j, then u{xi,X 2 ) = — |j| 
and u{x 2 X 3 ) = 2\j\, so that U{xi,X 2 ,X 3 ) = 0, another contradiction. Likewise, 
U{xi,X 2 ,X 3 ) = 0 if k = —j. Therefore, U{xi,X 2 ,X 3 ) < 0 does not happen. 

Since L°° {A^; d"^x) is a Banach space, the next corollary follows at once from 
Theorem 13.21 (ii). 

Corollary 3.3. Let 1 < m < N be integers. Then G^.n is a linear isomorphism of 
L°°] d"^x) onto the subspace of L°°{A^;d^ x) of generalized N-means of order 
TO, with inverse K^.n- In particular, the space of measurable bounded generalized 
N-means of order m is a closed subspace ofL°°{A^;d^x). 

As a by-product of Corollarv l3.3l the standard properties of convergence (strong, 
weak, weak*) for a sequence of measurable bounded generalized A^-means of order 
TO are equivalent to the corresponding properties for the sequence of kernels. 

When all the functions are everywhere finite and (HID is always assumed to hold 
pointwise, not just a.e., the formulas dSID and (IHH) hold pointwise and for every 
choice of {x 2 , ..., xn) G and {xi, ..., xn) G AA 

in dsn, every choice of x at € A 
in dSH). This is readily seen by revisiting the arguments justifying these formulas. 
In particular, no measure on A is needed for their validity. On the other hand, if 
A is a topological space and dx is a Borel measure, the formulas show, once again 
by induction, that the Borel measurability of u and U := Km,N{u) are equivalent 
(recall that every section of a Borel measurable function is Borel measurable). This 
is not true if (HD does not hold pointwise or if the functions are not everywhere 
defined and finite, because modification on a null set, even a Borel one, need not 
preserve the Borel measurability of functions. 

4. Integrability 

In the previous sections, we saw that the kernel u of a generalized mean U is 
uniquely determined by U (up to modifications on a null set) and that the measura¬ 
bility and essential boundedness of U and u are equivalent. It is a natural question 
whether some integrability properties are also transferred. However, it immediately 
appears that generalized N -means cannot have good integrability properties with 


^This is always true if dx (tr-finite) is non atomic. 
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respect to d^x when A has infinite measure, regardless of whether the kernel is 
integrable. Rather than confining attention to the case when A has finite measure, 
we shall discuss integrability relative to Pd^ x where P is a probability density 
on A^, i.e., P G ; d^x), P > 0 and Pd^x = 1. The last condition is 

convenient but inessential and can always be achieved by scaling. 

On the other hand, it will be important to assume that P > 0 (a.e.) and that P 
is symmetric. That P > 0 ensures that the measures d^x and Pd^x are absolutely 
continuous with respect to one another and, hence, that the null sets are the same 
for both measures. The symmetry of P is convenient to formulate the problem 
of interest. In general, given 1 < fc < A^, choose indices 1 < < • • • < ifc < 

N and consider the function Pi^^,,,^i^{xi^, ...,Xi^) obtained by integrating P with 
respect to all the variables Xj with j ^ {ii, By Fubini’s theorem, Pii,...,^ is 

a (marginal) probability density on A^. The main point is that if P is symmetric, 
2/fe) is independent of the choice of the indices It will be 

called the k-variable reduction Pfj^-^ of P. Note that P(jt) > 0 a.e. on A^, that Pj^) 
is symmetric and that P(£) is the .^-variable reduction of P(fe) when 1 < (. < k. 

We observe in passing that the symmetry of P is sufficient, but not necessary, 
for the existence of P(fc),l < k < N — 1. Nothing more is needed later but, for 
expository purposes, we shall continue to assume that P is symmetric. 

The definition of generalized means shows at once that if m G L^(A'"; Pi^^-^dPx), 
then U := Gm,N{u) G L^{A^; Pd^x) and \\U\\i^pdN^ < Further¬ 
more, UPd^x = Surprisingly, it is generally false that U G 

L^{A^]Pd^x) implies u G {A^■, Pi^ra)d™x), as shown in Example 14.II below. In 
particular, contrary to intuition, the relation UPd^x = uP{m)dPx need 
not make sense even though the left-hand side is defined. This is easy to overlook 
in formal calculations. 

More generally, if I < r < oo and u G P’'(A’”; P(^)(i™x), therfl Gm,N{u) G 
U{A^]Pd^x) and ||Gm,w(M)||r.Pd" 2 : < ||M||r,P(„)d™x, but Gm,N{u) € L^{A^-,Pd^x) 
need not imply u G P’'(A™; 

Remark 4.1. From the above, {Gm,N{u) : u G P’'(A’"; P(„)d’”a;)} C {Gm,N{u) G 
U{A^-Pd^ a:)} and equality need not hold, but we do not know whether the former 
space is always dense in the latter. A notable difficulty is that truncation does not 
preserve the generalized mean structure. 

Example 4.1. With A = (0, oo), let dx denote the Lebesgue measure. For i, j G N, 
set li := {i — 1, i] and Qij := li x Ij. We define P on (0, oo)^ by 

(4.1) P = py on Qy, where := | ^'j _4 J j* _ jj ~ j’ 

Then, P is symmetric and P > 0. The integrability of P follows from — 

12\i-j\=i Pij Pij■ The first sum is equal to 2 ^“ 4 ( 21 -!-1)“^ < 00 and the 

second one is majorized by + j)"'‘ = Y.V =2 T.i+j=k^'^ + “ 

\)k~'^ < 00 . That Pd^x 1 is of course inconsequential. 

Next, let u be given by 

(4.2) u = 2{-iyi on fi. 

^Since //““(A'"; = L°°(A™;d^x) and L^{A^;Pd^x) = L°° (A^ ■, x), this is also 

true when r = oo; see Theorem l3.2l (iiV 
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Set U = Gi, 2 (w)- By U = (-l)*z + on Qij and + (—l^jl = 1 

when \i — j\ = 1, whereas |(—l)*i + (—< i+ j in all cases. The integrability 
of \U\P, i.e., U € L^{{0,oo)^]Pd'^x), follows from = 

J2\i-j\=iPij + J2\i-j\^i + before, the first sum is 2J2'Zii‘^^ + 

1)“^ < oo and the second one is majorized by J2Tj=ii'^ +j)~^ = Si+j=fc(* + 

j)-^ = EZiik-l)k-^<^- 

Now, P(i) = Xi on li, where Xi := J^^iPij- Assume i > 2. By |^.j[ j, Xi = 

{2i - l)-2 + (2i + l)-2 + + j)~‘^ > (2* + 1)"^ and so, by 

|u|P(i) = 2iXi > 2i{2i + 1)“^ on It. Since X]fc:2*(2* + 1)”^ = cxji follows that 
u ^ G^(( 0 , oo); P(i)dx). 

In the remainder of this section, we show that, in spite of Example 14.11 , a 
suitable extra condition on P ensures that a generalized fV-mean U of order m 
is in L'^{A^; Pd^x) with 1 < r < oo if and only if its kernel Km,N{U) is in 
U {N^ ] P(ja)hlNx) . This condition is independent of the measure dx and of m and 
N. Its generality is addressed in the last two theorems. 

Once again, we begin with the case m = 1. 

Lemma 4.1. Suppose N >2 and that for a.e. (x 2 , ■■■,xn) in some measurable 
subset A C of positive d^~^x measure, there is a constant r(x 2 , ...jXn) > 0 

such that 

(4.3) P{-,X 2 , ■■■,xn) >T{x 2 , ■■■,xn)P{i) a.e. on A. 

Then, there is a constant a > 0 such that, for every r G [l,oo) and every U € 
U{A^-Pd^ x), the set of those {x 2 ,..., xn) S A such thatU{-, X 2 , xn) G L’’(A; P(i'jdx) 
and \\U{-,X 2 , ■■■, a::w)||r.P(i)da: <oAB\\U\\ r,Pd^x is not a null se0 ofA^-\ 

Proof. The condition (|4.3I) is equivalent to essinfj^jgA P(a:i,..., a:jv)T’(i)(a:i)“^ > 0 
for a.e. {x 2 , ...jXn) G A. The function P{xi, ...,XN)P(i)ixi)~^ is a.e. finite on 
A^ (because P > 0 is in L^{A^-,d^x) and P(i) > 0) and measurable. Thus, 
essinf 3 ;jgA P(xi, •)P(i)(xi)“^ is a.e. finite and measurable on A^~^ (see e.g. [5j p. 

271 and p. 275]). As a result, upon replacing T with ess inf a;jgA P(a;i, ^ 

r, it is not restrictive to assume that T is measurable on A. If so, the set := 

{{x 2 , ...,xn) £ A : r(ai 2 , ...jXn) > e} has positive d^~^x measure if e > 0 is small 
enough and, by (SSI), we infer that for a.e. {x 2 , ...,xn) £ A^, 

(4.4) P{-,X 2 , ...,xn) > £P(i) a.e. on A. 

Furthermore, the d^~^x measure |Ae| of is finite. Indeed, by integrating (14.411 
on A X Ag, we get 1 > /axa^ x > £\Ag\, so that |Ae| < e~^. We claim that the 
theorem holds with a := |Ae|“^e“^ > 1. 

Let U £ U{A^-Pd^ x) be given, where r £ [l,oo). Since P > 0 , P is mea¬ 
surable. Hence, P(-,X2, ...jXn) and |P(-,ai2, •■•,a;Ar)|’'P(i) are measurable on A for 
a.e. (x2,...,X]y) £ A^~^. By Fubini’s theorem, |P(-, X2,..., XAr)|’’P(-, a;2,..., x^v) £ 
L^(A;dx) for a.e. (x2, ...jXjv) £ A^“^ and so, by ( 1441 ) . 

/ lP(x,X2, ...,X]v)rP(i)(x)dx < £~^ / |P(a:,a;2, ...,a;jv)rP(x,X2, ...,XAr)cia: < oo, 

Ja Ja 


^We need not debate whether this set is measurable. 
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for a.e. {x 2 ,---,xn) G A^. This shows that U{-, X 2 , ■■■, xn) G L’’(A; for a.e. 

(X2, ■■■,Xn) G Ae. 

By contradiction, suppose now that the inequality \\U{-,X 2 ,---,XN)\\r,P(^i^dx < 
holds only for {x 2 ^ ...^xiq) in a null set of A^, i.e., 

\^e\~^\\U\\lpaN^ < e [ \U{x,X2,...,XN)rPi{x)dx, 

Ja 

for a.e. {x 2 ,---,xn) G Together with (14.411 . this yields 

\M~^\\U\\r Pd^a; < [ \U{x,X2,.:,XN)rP{x,X2,.:,XN)dx, 

JA 

for a.e. {x 2 ,---,xn) G A^. Then, upon integrating on Ag, we find ^ 

iAxAjurpd^^<\\u\\:,Pdr^x^ which is absurd. □ 

We now solve the integrability question when m = 1. 

Lemma 4.2. If N >2, suppose that for a.e. (x 2 , ■■■, xn) in some subset A C 
of positive d^~^x measure, there is a constant r{x 2 , ■■■,xn) > 0 such that 

(4.5) P{-,X 2 ,---,xn) >I'{x 2 ,---,xn)P(i) a.e. on A. 

Assume 1 < r < oo and that U = Gi^Niu) for some a.e. finite function u on A. 
Then, U G L'^{A^-,Pd^x) if and only if u G L’'(A; P(i)dx). If so, ||P||r,Pd'Vx ^ 
||M||r-,P(i)da; < Cr{N,P)\\U\\r^pdNx where Cr{N,P) > 0 is a constant independent 
of U and of u. 

Proof. Since the sufficiency was already justified before Example 14.11 we only 
address the necessity and, with no loss of generality, assume N > 2. Suppose 
U := G'i,a(m) G U{A^■,Pd^ x). Once again, since P > Q, U is measurable and so, 
by Theorem 13.11 u is measurable. 

Now, recall the formula (ED for u = Ki^i\[{U), in which {x 2 ,..., xn) is arbitrarily 
chosen in a co-null set Sn-i of A^“^ and (a;i, ...,xn) is arbitrarily chosen in some 
CO- null set of A^. For convenience, rewrite this formula as 

(4.6) u{x) = NU{x,X 2 , .■■,xn)+ c, 

where c := U(xi,...,xn) — NGi^n(U{-,X 2 ,---,xn)){xi,...,xn) is a constant (i.e., 
independent of a;). By dm), c is uniquely determined by u and by {x 2 , ...,xn), which 
shows that c is also independent of (xi, ...,xn). Since P is a probability density on 
A^, nonnegative constant functions on A^ are equal to their U'{A^\Pd^ x) norm 
for every r. It follows that {NGi^is[{U{-,X 2 , ■■■,xis[)) G L^{A^;Pd^x) and) 

(4.7) |cl < ||P||r,Pd«x + ||A^Gl,Af(C7(-,X2, ..., Xiv))||r,Pd"a;- 

We now evaluate ||A^G'i_Ar(P(',X 2 , 2?A))||r,Pd«^x after making a suitable choice 

of (x 2 , .■.,Xn)- By Lemma El] and the completeness of d^~^x, there is a > 0 
(independent of U and r) such that the subset of those points (xi,..., xat) G A such 
that U{-,X2,.:,xn) G L^{A-P(i)dx) and ||P(-,X2, ...,XAr)||r,P(i)dx < pd^x 

is not contained in the null set A^~^\Sn-i- Hence, there is (x2,...,xa) G Sn-i 
such that U{-,X 2 , ■.■,xn) G P''(A; P(i)dx) and that 

(4.8) 1117(*, X 2 , ..., Xjv) I |r,P(i)da: ^ O ^ 11|r,Pd-^a: ■ 
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By definition of Gi^n, 


N 

(4.9) NGi^n{U{-,X2, ...,xn)){xi,...,xn) = '^U{xj,X2, .-.xn)- 

j=i 

We claim that, when j is fixed, U{xj,X 2 , ■■■,xn) is in U'{K^\Pd^x) when viewed 
as a function of the N variables xi,..., xat. Indeed, by the symmetry of P and (|4.8I) . 



\U{Xj,X2, ■■■, XN)rP{Xl, ..., XN)dxi ■ ■ ■ dxN = 


/ \U{xj,X2,.:,XNWP(i){xj)dx < a\\U\\lp^N^. 

Ja 

Thus, by (14.9L NGi^n{U{-,X2, ■■■, xn)) is the sum of N functions of Pd^x) 

with norms bounded by Hence, ||7VG'i_Ar(?7(-, a? 2 , XN))\\r,Pd^x ^ 

Na^P\\U\\r^PdN^ and so, |cl < (1 + A^a^/’’)||17||r,pd'Vx by (14.71) . Then, by (14.61) and 
(|4.8[) . u e U{K\P(j)dx) and ||M||T-.P(i)da; < Gr{N,P)\\U\\r^PdNx where Gr{N,P) := 
+ 1 . □ 


When m > 2, we first need a lemma similar to Lemma l4. 11 

Lemma 4.3. Suppose N >2 and that for a.e. xn in some subset B C A of positive 
dx measure, there is a constant 'fixN) > 0 such that P{-,xn) > ^{xn)P(n-i) a-e. 
on Then, there is a constant /3 > 0 such that, given U G L'^{A^; Pd^x) with 

r G [l,oo), the set of those xm G B such that U{-,xn) G U'{A^' ^P(Ar_i)d^ ’-a;) 
and \ \U{-,XN)\\r,p^,^. r,Pd^x is a null set of A. 

Proof. Modify the proof of Lemma 14.11 in the obvious way. □ 

In Lemma 14.31 P{xi, ...,xn) > j{xn)P(n-i){xi, ...jXn-i) holds on a co-null 
subset of A-^"^ X B, but this is not immediately obvious (a union of sections of 
d^- '^x measure 0 has d^x measure 0 only if it is d^x measurable, which is not 
generally true): 

Remark 4.2. The condition P{-,xn) > 'y{xN)P(N-i) o..e. on A^“^ for a.e. xn G 
B implies the same condition with 7 replaced with 7 > 7 and 7 measurable on B 
(see the proof of Lemma \4.1\ ). Since P — P(^n-i) ®7 is measurable on A^~^ x B, the 
subset of A^~^ X B where P > P(Ar_i) ( 8)7 is measurable and, by a straightforward 
contradiction argument, it is co-null in A'^”^ x B. Since j > it follows that 
P{xi,...,xn) >j{xN)P(N-i)ixi, ...,XN-i) a.e. on A^-^ x B. 

The next lemma is elementary, but crucial to the induction procedure. 

Lemma 4.4. Suppose N > S and that for a.e. xn in some subset B G A of positive 
dx measure, there is a constant ^{xn) > 0 such that P{-,xn) > 'y{xN)P(N-i) a.e. 
on A^~^. Then, for a.e. xm-i G B, we have P(Ar_i)(-,a:Ar-i) > 'y{xN-i)P(N- 2 ) 
a.e. on A^~^. 

Proof. The short version of the proof goes as follows: For every xm G B such 
that P{-,xn) > 7 (a;Ar)P( 7 v-i), integrate this inequality with respect to xi to get 
P(^N-i){x 2 , ■■■, ,xn) > 'y{xN)P(N- 2 ){x 2 , ■■■,xn-i) and change Xj into Xj-i for 2 < 
j < N. It is easy to justify this procedure -perhaps for a ;at in a smaller co-null 
subset of B- based on Fubini’s theorem and the fact that a.e. section of a co-null 
subset is co-null. We skip the minor details. □ 


16 


IRINA NAVROTSKAYA AND PATRICK J. RABIER 


Theorem 4.5. If N >2, suppose that for a.e. xn in some subset B C A of positive 
dx measure, there is a constant 'j^xn) > 0 such that 

(4.10) Pi‘,xpf) > ^{xn)P(^j^_i^ a.e. onA^~^. 

Let 1 < m < N he integers and let 1 < r < oo. 

(i) Ifu e thenGm,N{u) G L^{A^-,Pd^x) and \\Gm,N{u)\\r,pd^x < 

I I'^l 

(a) Conversely, if U £ L^{A^;Pd^ x) is a generalized N -mean of order m, then 
Km,N{U) G L’'(A'"; and there is a constant C(m,N,P) independent of 

U such that ^ G{tTI, N, P^\\U\\j.^pdnj.. 

Proof, (i) As was observed earlier, this is straightforward (and (I4.10|) is not needed). 

(ii) Since Km,m = I, there is nothing to prove when m = N (and Crijn, m) = 1). 
Accordingly, we henceforth assume N > m+1. IfTO = 1, the result follows from 
Lemma [4.21 with Cr{l, N, P) = Cr{N,P) from that lemma. Indeed, by inductive 
application of Lemma l4.4l and Remark [4.2l if A^ > 3, or directly from (|4.10l) if A^ = 2, 
the condition (14.5p holds with A = and T{x 2 , ...,xn) '■= • ■ ■ 7 ( 2 ^ 2 )- 

From now on, m > 2 (and N > m + 1, so that N > 3). By transfinite induction, 
we assume that the result is true if m is replaced with m — 1 and that it is also 
true if N is replaced with A^ — 1. Since N cannot be replaced with A^ — 1 without 
changing P, a clarification is in order: What is actually assumed is that if N 
is replaced with N — 1 and if Q > 0 is a symmetric probability density on A'^”^ 
satisfying (14.1011 with N replaced with A^—1, there is a constant Cr{rn, N—1, Q) > 0 
independent of the generalized {N — l)-mean V G L‘^{A^~^-,Qd^~^x) of order 
TO, such that Kra,N-i{V) G L'^{A'^-,Q(^rn)dPx) and that ||A:„,Ar(R)||r,Q(„)d"‘x < 
a(m,7V-l,Q)||l/||,,Qd«-i,. 

In (13.31) . Xn is arbitrary in some co-null subset of A and so, by Lemma|331 we may 
assume that U{-,xn) G L'~[A^~^-,P(^n-i)P~^x) and that \\U{-,XN)\\r,p^i.f_i^d^-^x < 

l3^^^\\U\\r^PdNx, where /3 > 0 is independent of U. With such a choice of xn, 
it follows that the generalized A^-mean of order to — 1 (see the proof of (13.31) 1 
NGN-i,NiU{-,XN)) — {N — m)U is in {A^ ; Pd^x) and, recalling (i), that 

(4.11) \\NGn-i,n{U{-,xn)) -{N- m)U\l^paM, < 

(iV/ji/’-+ (IV - to)) \\U\l^pd^,. 

Thus, we can now use the hypothesis of induction on to to infer that 
Km-i,N {NGn-i,n{U{-,xn)) - {N - m)U) G L’'(A™“i; P(„_i)d"‘“ia:) and that 
the inequality \\Km-i,N {NGN-i,NiU{-,XN)) - {N - m)U) \\r.^p^^_.^^d’^-^x < 

Grim — 1, N, P)||A^G'Ar_i n{U(-, Xn)) — (N — to)[/|L pd^j, holds. Then, by (i) and 
gUD, y := NUi; XN)-Gm-1,N-1 {K^-i.N {NGn-iMU); ^n)) -{N- m)U)) G 
L^{A^~^; P(^N-i)d^~^x) and 

(4.12) 

(iV/?!/’- + Cr{m - 1,N, P) (iV/ 31 /’' + {N- to))) ||17||,,Pd^,. 

From the proof of (ESI), V above is a generalized {N — l)-mean of order to 
with kernel Km,N-i{V) = Km,N{U). By Lemma SHI Q = P(^n-i) satishes (14.101) 
and Q(rn) = P(m)- Thus, by the hypothesis of induction on N, it follows that 
Km,N{U) = K.^,N-iiV) G and that ||iV™.A-i(4^)||r.P(„.,d™x < 
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Cr{m, N—1, ^’(Ar-i))||^^||r,P(N_i)d«^-ia; where Cr{m, N—1, P( 7 v-i)) > 0 is a constant 
independent oi V. That Km,N{U) £ P(^)d'"a:) and \\KJn,N{U)\\r,p^^^d^x < 

Cr{m, N, P)\\U\\r^PdNx thus follows from (14.121) with 

Crim,N,P) ■= 

Cr{m,N^- + a(m - 1, TV, P) + (A^ - m)) ) . 

The following corollary is the obvious variant of Corollary 13.31 □ 

Corollary 4.6. Let 1 < m < N be integers and let 1 < r < oo. If N >2, suppose 
that for a.e. xn in some subset P C A of positive dx measure, there is a constant 
'y{xN) > 0 such that P{■, Xn) > jixNjP^N-i) o,.e. onA^~^. Then Gm,N is a linear 
isomorphism o/L’’(A'"; P(jn)d^x) onto the subspace o/L’'(A'^; Pd^x) of generalized 
N-means of order m, with inverse Km,N- In particular, the space of generalized N- 
means of order m in L'^ ■, Px) is a closed subspace of L''■, Pd^x). 

We may once again highlight the fact that, by Corollary 14.61 the P’' conver¬ 
gence properties of sequences of generalized Wmeans of order m are equivalent to 
the same properties for the sequences of kernels. A related albeit different issue 
(preservation of a product structure under weak convergence in spaces when 
TO = 1) is discussed in [U Theorem A.3]; condition (14.101) is not needed. 

It is obvious that (14.101) holds when Pd^x is a product measure, i.e. P = 
where p > 0 is a probability density on A (and then 7 = p). More generally, every 
symmetric probability density can be approximated in L^{A^;d^x) by positive 
symmetric densities satisfying (14.10^ : 

Theorem 4.7. If N >2 and P is a symmetric probability density on A^, there is 
a sequence Pn > 0 of symmetric probability densities on A^ such that Pn satisfies 
and Pn ^ P in L^{A^;d^x). 

Proof. Since dx is cr-finite, it is a simple exercise to show that there is a probability 
density p > 0 on A. For fc G N, set := min{P, fcp®^} > 0, so that Qk < P and 
Qk ^ P a.e. By dominated convergence, —>• P in L^{A^-,d^x) and so Pk '■= 

IIQfclll ^Qk > 0 is a symmetric probability density and Pfc —P in P^(A^; x). 

In addition, Pk < fc||(5fc||“^N^p®^. Thus, upon replacing P with Pk, it suffices to 
complete the proof under the assumption P < for some constant C > 0. 

For n G N, define Pn = ri{n 1)“^(P -|- n“^p®^). Then, P„ > 0 is a symmetric 
probability density on A^. Also, Pn ^ P a.e. and Pn < P -|- p®^. Thus, by 
dominated convergence, Pn —>■ P in L^{A^;d^x). We now verify (14.101) for Pn- 
First, Pn,(A-i) = n{n l)“^P(jv-i) -|- (u -|- l)“ip®('^“i) and so the assumption 
P < with C > 1 entails Pn,{N-i) ^ [n- + l)~^{nC l)p®(^“^). Next, 

Pn > (n-|-l)“^p®'^, so that P„ satisfies (14.101) with B = A and 7 = (nC'-|-l)“^p. □ 

If A has finite dx measure and attention is confined to probability densities in 
P“(A^;d^ x), a stronger result is true. 

Theorem 4.8. If A has finite dx measure, the set of bounded symmetric prob¬ 
ability densities P on A^ such that P > 0 and that P satisfies Ii4-10\ ) contains an 
open and dense subset (for the L°°{A^;d^x) topology) of the set V of all bounded 
symmetric probability densities on A^. 
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Proof. For brevity, we omit a few technical details. For ^ S N, call Oi the set of 
those P €V such that ess sup P < £ and ess inf P > Then, Oe is open for the 
a;) topology (empty if £ = 1 ) and, if P G it is easily verified that 
P > 0 satisfies (14.101) with B = A. 

li P GV and n G N, set Qn := max{P, n~^} G L°°{A^; d^x). Then Qn is sym¬ 
metric, Qn > 0 and —>■ P in L°°{A^;d^ x). Since A has finite measure, Qn —>■ P 
in L^{A^-d^x) and so P„ := \\Qn\\f^j^N^Qn G P and P„ - 5 > P in L°° {A^; d^ x). 
Furthermore, Pn G O := (open) if n > ||P||an for then P„ G with 

any £ > {n + 1) ma,x{\\Qn\\i^d^xt IIQnll^divJ. □ 

In Theorem 14.81 P is a Baire space (a closed subset of L°°{A^] d^x)) and P* is 
residual in P in the sense of Baire category. Thus, by Baire category estimates, the 
set P\P* contains only non-typical elements, but such non-typical elements may 
look deceptively innocuous: 

Example 4.2. With A = [0,1] and dx the Lebesgue measure, let N = 2 and let 
P{xi,X 2 ) ■= Sjcci — X 2 \. Then, P > 0 a.e., P(i)(aii) = 3(a:f — Xi + and is 

equivalent to \xi —X 2 I > '){x 2 )(x\ —Xi + \) for a.e. xi G [0,1], which does not hold 
for any X 2 G [0,1] and any constant 7 ( 2 : 2 ) > 0 since the (continuous) left-hand side 
vanishes when xi = X 2 . Thus, fails. 

If Pd^x is the probability that N particles have the coordinates a:i, ...,xn, re¬ 
spectively, then it is natural to require P{xi, ...,xn) = 0 when Xi = Xj for some 
i j. A short elaboration on Example 14.21 shows that such densities P do not 
satisfy (|4.10l) . 
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